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Yalduted "using the Kolmogorov Smimov test. The results 
fobiained from these teste proved that the proposed model is 
staisially valid 
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1. Introduction 


In India, around 26% of commuters are traveling for about 90 minutes every day. Although the 
average commuting length is 29 minutes in India, a large group of commuters travels more than 
‘an hour to and from the place of work [1]. Thus, congestion becomes a persistent problem in the 
Indian traffic scenario due to the heterogeneous traffic, lack of discipline, ete. Therefore, 
exacting ravel-time information for individual modes separately becomes a tedious task. In this 
study, authors have attempted to use public transport ([e., buses) as probe vehicles to estimate 
the travel time of the test mode (i. car) for an arterial corridor. 


‘The current study chose public transit (bus transit) as the probe mode; because there will be no 
prevalence of privacy issues in using the public buses. The probe vehicles ean be fited with GPS, 
receivers and can be easily tracked without much interference. Also, by using the bus asa probe 
‘mode, it is possible to acquire a large Sample of data asthe buses may take multiple runs on the 
arterial corridor in day. 


tis a well-established fact thatthe majority of the developed nations like the United States and. 
European countries prefer private transport over public transport for better accessibility [2-6]. 
However, the scenario is different in developing nations like India, China, ete., where the 
‘majority of people rely on public transport than private vehicles [7] In this study, authors have 
‘considered public transit (bus) as the ideal choice for probe vehicles. As public transport follow a 
regular pattern such as predefined route and schedule, it is easier to track them and use them as 
probe vehicles [8]. Thus, the bus is chosen as a viable probe mode for model development. 
However, there are some typical shortcomings in publi transport travel-time calculations 


Issues involving public transit travel-time estimation are 
1) The public transport buses vary in travel speeds and transfer times along diferent routes [9]. 


2) In certain eases of missing information on the schedule, the average travel speeds ae assumed 
forthe whole route [10-12] 


3) In most cases, the transfer waiting times are ignored or assumed as a constant2, 13] 


4) Very few studies have incorporated scheduled arrival or departure times for travel time 
analysis [9] 


Some ofthe exiting studies have used buses asthe probe vehicles and calculated the travel-time 
for the other modes. They have conducted a correlation analysis between the two modes (bus and 
‘other modes like a ear, two-wheeler, auto, ete) and developed linear regression models. The idea 
‘of developing non-linear models using machine learning algorithms was limited for mode-wise 
travel-time estimation. Moreover, very limited studies have included dvvell time in their models. 
Existing research has reported few simulation-based studies too and real-time based Field 
scenario studies were very limited. There are many factors affecting travel-time; some are 
categorized as trafic factors (distance, speed, travel-time, interseetion length), road geometry 
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related fictors (road width, gradients) and vehicle characteristis (bus stop halt time, passenger 
count). It is very difficult to develop model that includes all these factors together and very 
limited research is identified where all these factors are considered together. 


‘The above issues are addressed in this study. This study considers the non-linear relationship 
between the two modes (Car and Bus) using ANN and SVM. Transit buses as probes offer a 
‘number of advantages: (1) covering a large portion of urban networks; (2) GPS are installed by 
transit operators making the model cost-effective; and (3) privacy concerns can be eliminated by 
installing GPS devices in public transit systems (buses). However, there are certain drawbacks in 
using buses; for instance, the characteristics of the buses differ slightly in comparison to the 
‘other vehicle characteristics. The difference between the bus travel-time and the average travel 
lime of the stream is a random variable and modeling them for other modes is great challenge. 
Despite these randomness and biases involved in obtaining bus travel-time data itis considered, 
as the best option a a probe vehicle for other Vehicles in the traffic stream, This study includes 
the modeling bus characteristics sueh as bus slop dwell time by considering parameters like 
passenger count and dwell time at individual bus stops. 


‘The traditional tavel-time prediction models in the literature are broadly classified into four 
categories: Historical average models [14-17], Statistical models [18-21], Model-based 
approaches [22-27] and Machine learning models [28-35]. Among these, Machine learning 
‘models are gaining popularity as these advanced models are well suited for supervised and 
unsupervised learning, This paper involves mode-wise travel 

corridor using Machine learning techniques (ANN and SVM). 


time estimation along an arterial 


A suitable study corridor of $9.48 kilometers in length was chosen for this study. In this study 
corridor, there are 19 bus stops; the positional coordinates (latitude and longitude) of the bus 
stops were collected using a hand-held GPS during the field survey. The whole cortidor was 
segmented into 18 successive segments based on the 19 successive bus stop locations. The 
proposed travel-time estimation method is developed in two stages. In the first stage, the segment 
travel-time data of bus and car along with other parameters such as segment length, average 
speed at segments, intersection length and signal timings (ifthe intersection falls within the 
segment), bus stop dvell time, passenger boarding and alighting (if the bus has stopped at the 
‘bus stop within the segment) were given as inputs to the neural network for training. A back 
propagation algorithm was used to train the model. In the second stage of development, the data 
is firstly classified based on the road geometry (number of lanes and gradients). Thus, the 


complete corridor data was segreyated based on lanes -single lane to six lanes; and further, the 


data were categorized using gradients as a binary classifier in the SVM modeling process, The 
classified data along with the other parameters (Length of the segments, bus trvel-time, bus stop 
details such as halt time and passenger count, intersection length and signal time (red-time) were 
‘modeled using the SVM. The model parameters chosen for the ANN and SVM models were 
‘optimized using the K-fold cross-validation technique. For mode-wise travel-time modeling, a 
deeper understanding of the existing prediction models is required. The following sections 
provide a literature review on existing prediction models used for travel-time prediction, 
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2. Literature review on travel-time predi 


n models 


‘The travel-time prediction models can be categorized as statistical and simulation models 
istical models are: Regression models [15,36-39], Time series models [40-44], 
Kalman-filtesing models [17,24,32.42,48,46], ANN models [17,30,32,34,47-SO], Support Vector 
Machines(31,34,35,50-S4], Hybrid models [29,31,51,55,56]. Among these wide ranges of 
prediction models, Machine learning models have gained popularity. Machine learning models 
are capable of performing non-linear modeling; and also these are considered as appropriate 
techniques for handling complex data. is evident from the extensive literature that ANN is 
being widely used for bus arrival time prediction because of its ability to solve complex 
‘nonlinear relationships [17,32,34,50], To account for the nonlinear nature oftravel-time data for 
prediction, Kisgyorgy and Rilet [57] suggested modifications such as clustering techniques, 
‘modular neural network, expanded input nodes, and spectral basis neural network to ANN. Chien 
at al. [42] used integrated adaptive neural network algorithms to identify the prediction error in 
rwal-time; the bus travel-time prediction is assessed with a microscopic simulation model 
(CORSIM), Ishak and Aleesandru [58] used multiple topologies of the dynamic neural network 
to optimize the short-term travel-time prediction. However, existing studies [28,34,50,53,54] 
claimed that SVM is advantageous over ANN for short-term prediction. Zhong et al. [12] 
suggested assigning higher weights to the important variables can reduce the outliers and 
minimize their impact. Muller et al, [53] suggested that kernel algorithms are more efficient for 
travel-time prediction. However, the application of an accurate prediction methodology for 
‘mode-wise ravel-time prediction is considered as a potential research challenge for researchers. 


Various st 


‘The first work attempted in using public transit as probes for estimating the travel time of other 
‘modes was pioneered by Bue et al [59], The study used buses as probes for car travel-time 
«estimation. They had used the simple regression and ANN (artificial neural network) methods for 
analysis and found that the ANN method outperformed the simple regression methods. The study 
considered only historical data, and real-time prediction was not attempted. Hall and Vyas{60] 
‘compared bus probe data with automobile trajectories and the study resulted in analyzing the 
delays between buses and automobiles are interrelated. The analysis revealed that longer delays 
of automobiles also resulted in delays on buses traveling on the same route. Bertini and 
“Tuntiyanugulehai (61) developed tavel-time es 

by eliminating bus stop dwelling time. The study by Bertini et al. [62] compared the time- 
distance diagrams of different types of bus trajectories with the time-distance diagram of ears. It 
was found that pseudo bus trajectories are able to explain the car travel-time, But the study was 
tested for the morning peak period alone, 


imation models between automobiles and buses 


Most of the research fom existing literature have used either linear regression modeling or a 
‘combination of two or more methods for mode-wise travel-time prediction, It can be seen tha 
the combination of Kalman filer and ANN were predominantly used in literature. Chakraborthy 
et al, [36] developed simple regression equations that used buses as probes to determine 
automobile travel-time, This method was not successful in predicting travel 

Padmanabhan et al (63] estimated the bus travel-time in a linear relationship by incorporating 
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the dwell time into consideration. Kumar et al. [64] on contrary estimated the tavel-time for 
different modes with respect to buses used as the probe vehicle where the bus travel 
correlated o other modes by removing the dwell times at bus stops with associated acceleration 
and deceleration. In their study, two methods were proposed: (1) regression analysis and (2) ratio. 
‘method to analyze mode-wise travel-ime estimation. Shalaby and Farhan [65] proposed a bus 
travel-time prediction model using the AVL (Automatic Vehicle Location) and APC (Automatic 
Passenger Counter) data to model bus travel-time. The performance of the model was tested 
using a micro-simulation tool - Vissim. A Kalman filter was fitted to eheck the accuracy of the 
‘model. Jeong and Rilet [48] studied three different models to prediet bus arrival time. Historical 
data-based models, Regression Models, and Artificial Neural Network (ANN) models were used: 
and it was found that ANN models outperformed the other two. In this research, the AVL data 
was used and developed a model considering the dwell time and schedule adherence. Essaway et 
[66] used bus tavel-time data to estimate general link-tavel-times of neighbor (nearby) links. 
In this study, regression models were developed to relate bus travel-times to general link trvel- 
limes. Consequently, the estimated link trvel-times were used to calculate the travel-times of 
neighboring links. A'summary of existing studies that have used the probe vehicle technique for 
travel-time estimation is detailed in Table 1. From Table Lit is evident that in general regression 
‘models, Kalman filters and ANN are commonly used for mode-wise travel-time estimation. Both 
regression models and Kalman filters ean perform linear modeling. But, travel-time is a dynamic 
parameter and the factors affecting travel-time are also very complex and dynamie in nature. In 
‘order to model such complex parameters, developing a non-linear model is required, 


In onder to estimate the travel-time in the arterial corridor, an appropriate prediction 
‘methodology should be considered. The prediction model should be capable of capturing and 
‘modeling the relationship between various travel-time affecting parameters for mode-wise 
conditions. Machine learning models are gaining popularity as these advanced models are well 
suited for supervised and unsupervised learning. This paper involves mode-wise travel-time 
estimation along an arterial cortidor using Machine learning techniques (ANN and SVM). The 
‘main reasons to choose ANN and SVM for mode-svse tavel-lime prediction are 


(1) They are parametric models and suitable for handling complex data; 
(2) These methods are very efficient in modeling and deriving relationships involving bi- 
‘modal (bus and car) parameters. Travel-time is one such parameter that is influenced by 
many other fietors such as traffic and roadway characteristies. For example, ravel-time 
prediction at intersections is very challenging as the signal timings are mostly dynamic 
tnd actuated; such multi-level parameter modeling can be efficiently performed using 
‘machine learning models, 
ANN models have the ability to model complex parameters with repetitive iterations 
using randomized 


i 


ts and bias values 


SVMs have the capability of generalizing the data and it is also possible to achieve global 
‘minima for given training data. Further, ANN and SVM are highly optimized models 10 
approximate the results between the actual and predicted values to some extent 
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‘Table 1 


“Traveltime estimation using probe vehicle technique. 


“Author 


“Technique 


Key Findings 


Bac [67] 


"ANN, gression 


ANN outperformed regression 


Hall and Vyas (60] 


Compared ie 
luajectories of bus and 


The delays oT uses and her modes ae Ter TAT, 
Delay in automobiles consequently results in the 
delay of buses ina link, 


Cathey and Dailey 
2a) 


Kalman filer 


“The stad resulted in understanding conan 
between travel-time obtained from probes and loop 
detectors, 


“Tantganagalchar and 
Bertini [ot] 


Bus and car trajectories 


“The wavel times of atomobies and bases can Be 
correlated by eliminating bus stop dwelling tie. 


‘Chakraborty and 
Kikuchi [68 


Regression 


Proved buses can be used as probe vehicles 
determine automobile travel time 


Jeong and Rit [69] 


Tso at ST 


models, egresion and 
arficil neural network 


The ANN models outperformed Historica and 
Regression models. The ANN model was developed 
considering the dll ime and schedule adherence. 


Bin eal. [54] 


SVM and ANN 


The study Wavel me using the input variables, 
results indicated tht the SVM model outperformed 
ANN model 


Padimanaban oa 
[7o) 


Kalman fier 


Estimated travel time of bases Tacorporatng Oe Bar 
stop dwell time 


Kieu etal. (71) 


Regression 


The model was suitable for off peak hours 


Erawey and Sayed 
161 


Regression and Vissim 


"Estimated car tavel-time with an etrr of 176% 


han eal. [72] 


Tink waveline 
cstmation using MNL. 
model 


The ak travel 1 estimated By miming The 
ctor between expected path tnvel-time and observed 
path tavel-time 


‘Vasantha Kumar and 
Vanajashi (64) 


Regression and 
correlation ratio method 


Estimated travel time between automobile and ines 
by eliminating bus stop dwell ime with acceleration 
nd deceleration at bus stops 


row et 159] 


Regression analysis and 
Frequency distribution 


Bis arival prediction using a Sha card sytem THE 
‘mode calculated us arval time incorporating the 
passenger alighting and smart card swiping time 


Athin and Stinson 
v3) 


Regression analysis 


The number of passengers lighting, passenger 
boarding. umber of acces approaches and 
sgnilized intersections was identified as significant 
rameters in bus artval ime estimation 


‘Sharmila al (74) 


Hybrid model using 
SVM-PF 


‘Used a Bimodal modeling for buses and cam 
Estimated MAPE Value tained was 9.96 (car) and 
11.24 (bus) 


Kumar tal. [46] 


‘Time-space 
disretzation using 
Kalman-fiter 


1 speed based walls sir models was developed 
using the Godunov scheme. Kalman filter was used 
for prediction. Kalman filter method outperformed 
ANN. Regression and Historical average 


Zhang etal. (33) 


Pattern matching 


“The Spatial-lemporal traffic paters are matched Tor 
‘multi-step tavel-ime forecasting 
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3. Study corridor 


‘The chosen study corridor for this study consists of busy arterial roads of the Mumbai city 
covering a stretch of $9.48 kilometers. This study corridor is considerably long connecting the 
Jogeshwari —Vikhroli link road (10.6 km), the Western Expressway (25.33 km) and the Eastern 
Expressway (23.55 km) forming a triangular corsidor around the ITT Bombay campus. The 
proposed study corridor contains nineteen bus stops and nine signalized intersections. Figure La, 
shows the selected study corridor for this study. The data collection process for this study was 
performed by collecting real-time travel-ime data fram the Field forthe two modes- car and bus 
using onboard hand-held GPS (E-Trex 10) as shown in Figure 1.b. The data were collected for 
five days (Monday-Friday) for both the modes bus and car for the morning peak period (from 
8.00 am to 11.00 am. 


Fig, Lb, Data collection using hand-held 
GPS 


Fig, La, Stuly cord. 


3.1, Study parameters: 
In order to achieve ear travel-time using the bus as probes, enormous factors were considered for 
‘modeling the relationship beween them, Out of which certain important parameters were 
‘grouped into two min categories such as traffic parameters (distance, speed, tay 

intersection length) and road-way characteristics (road width, gradients bus stop halt time, 
passenger count, etc). Table 2 gives the details of the variables used in this study, and these 
parameters were selected based on literature review and Field conditions adaptable to the study 


corridor. 
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Variable Description 
Tas stop 

based The study coridor was segmented using the identified bus top waypoints 
segments 


“The Taverne Tora Was wed a GTCUaTe We aReE TRE 


Tegmenis Wa The 


Dine ‘help of the latitude and longitude positions obtained from the GPS data. 
EEE Segment rvs ime dita forthe car was obtined wing the GPS enabled inthe car 
anc T STATS TEBE Wa TTT TG He BOTT PSOE 

time daring dat collection 
TH TRESS on The TRE Were ard on Te ld ARTS TCR was ET 
Intacction "from he ld and compared with he Google distance xlclatorfo check forthe 
Tenth sccrcy ofthe messied interaction ent an stop time forthe ed eel at aos 
intentions nas determined from th field ring te night vey condi. 
Grabear Uae season nmaon sae te GPS aie pa ne 
SS 
Lancs as observed tht the cord contains snl lan ix nes tien asc of 
the road. The enh of th ro seth forever ane wa aust 
Takame TST ve ine was manly counted t ach ne te Bas topped at bas ae 
sng hae held GPS and stot 
Pasangat The mio o passongers boarding and aljhing he Bus al Sv Bus Sop wa 
Count ed during the uve. 


‘The above-listed parameters in the table are chosen as the input parameters for both ANN and 
SVM modeling. These parameters are considered as the most important attributes which affect 
‘corridor level ravel-time estimation. 


4, Methodology 


‘This study is oriented in determining the travel-time of different modes on the arterial coridor 
using buses as the probe vehicles. The idea of mode-wise travel ime estimation is very less used 
‘due to the various complications involved in modeling when considered for two different modes 
‘Therefore, this study aims at modeling mode-wise travel time by including all the relevant 
parameters that are influential with respect fo the two modes (bus and car). Fig? illustrates the 
‘overall methodological framework of the proposed model. The basic interpretation of the 
proposed study is given as follows: 


CIP = s(BTT.tL.Gr,DWT_PCSPV.SG) a 
Where, 


Desired output: CTT= Car travel times Inputs for the model: BIT= Bus travel time: IL. = 


Intersection length; Gr = Gradients; DWT: Dwell time; PC = Passenger count; SPV = Speed 
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Tnput data for modeling 


— 


SVM "ANN 
Normalizing the data and Choosing the taining 
perform cross validation Tinction and hidden 
for optimizing parameters aiead 
“raining the SVR model Dividing the data into 

with tuned model appropriate training, 

Parameters validating and testing 
: i 
Predicting the ravel time Using the trained data for 
using trained model ing the model 


Fig.2. Overall methodological ramework 


4.1 ANN model development 


Although the basic taining procedures of ANNs are almost similar, the aceuracy of the result i 
_reatly dependent on the type of inputloutput combinations 


An artificial neural network mathematical model is writen a 
y= f(U)=W, tanh +0 +8) +b, @ 
‘Where y i the output of the neural network model 

Uisacolumn vector of size p that cantains the p inputs ofthe process 


Wo is a row vector of size n that contains the weights of the neural network model from the 


hidden layer tothe output; 


Wi is a matrix that contains the weights of the neural network model from the inputs to the 
hidden layer. This matsx has n rows and p columns: 


Bi is a column vector of size n that contains the biases from the input to the hidden layer of 
the neural net model: 
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bo is the bias (scalar) from the hidden layer tothe output ofthe neural net model: and 
‘anh is the activation function (in this study hypesbolic tangent function is used, 


‘The activation functions are needed to introduce the nonlinearity into the network. They 
determine the non-linear relationship between the input and the output layers. There are many 
activation functions used in practice (For example, piecewise linear, Gaussian and sigmoidal 
functions). The sigmoidal functions such as logistic and hyperbolic tangent functions tan h) are 
the most common choices. Sigmoidal functions such as tanh or are tan produce both positive and 
negative values tend to yield faster training than the other functions such as logistie functions 
that produce only positive values [7S]. Hence, in this study, the tanh funetion is used to scale 
inputs and targets to (-1, 1), 


“The activation function for tanh function is given by the following formula 


z cy 


‘The curve for tanh is very similar to sigmoid withthe only real diference lies in output range for 
both activation funetions. The tanh Function ean map input values toa range between -1 and I. It 
is centered at 0. Like the logistic sigmoid, the tanh function is also sigmoidal ("s”-shaped), but 
instead outputs values that range (-1, 1) Thus strongly neyative inputs to the tanh will map t0 
negative outputs. Additionally, only zero-valued inputs are mapped to near-zero outputs. 


4.2. Training procedure: back propagation algorithm, 

Backpropagation is a powerful algorithm that is used to train the multilayer perceptron and 
‘obtain the weight of each link. These synaptic weights enable the network to move closer to the 
desired response by frequent iterations. A most common measure of the error in the 
backpropagation is the mean square erro, 


Msi 


\ 
13 o.-n? my 
Where 9, ithe predicted valu and 3s tbe observed value 


‘The backpropagation algorithm is the most popular algorithm for transportation use{14] [16 
Hence, inthis study, authors have resorted to using the backpropagation algorithm. 


43, Network architecture 


Existing literature consists of different types of ANN architectures for forecasting. purposes. 
However, the multi-layer perceptron has gained popularity among all other structures [75]. In 
this study, an optimal ANN model was developed using different combinations of network 
architecture. The ANN (ij,k) indicates a network architecture with ,j and k neurons inthe input, 


hhidden and output layers respectively. The hidden and output layers are responsible forthe actual 
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processing in the network. As part of the taining process, the weights and bias parameters are 
‘generated in the hidden layer. The nodes of the hidden layer capture the pattern ofthe data and 
perform a nonlinear mapping between the input and output variables, The MATLAB tool was 
used for determining the training sets, training ANNs and initializing the values of weights and 
biases using trial and error approach and optimizing for the best fit model, Table 3 and Table 4 
show the weights and biases values ofthe trained networks, 


‘Tables 
Weights ofthe uained network 
ue iT 
K 
i Ts a tee 1) Age 


1 1a8 1683-175 156 1031-176 141 0.4993 2.005 
2 2439-142 2.209 1725-011 1317-227 0.833 1.822 
3 LMA L908 0.781172 2720, 1.904 1.132.745 1.065 


12, isthe weight between neuron of layer and of the neuron af the previous yer 


‘Tables 
“The bias ofthe trained network 

3B 7 

fy ties ae ote Sr See BS 


1 291 218 1496 072 0 072 -1aS 218 2912 
2003 AS 029-118 0.07 -028 -0.12-0.14 0,006 


is he bias oF neuron of the layer 


‘The weights and bias values were estimated as deliverables of the ANN modeling using the 
following equations 
we Says 6) 


the weights; « : Lagrange multipliers: «are the inputs and the bias values are ealeulated 


o 


Where $ isthe support vectors 
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44, Data modelling using ANN 


In this study, authors are attempting to predict car travel-time using the buses as probe vehicles. 
tis appropriate to segregate the data as segments based on bus stop locations to understand the 
Variation between the travel-time of buses and other modes (car). Hence for this study, bus stop 
locations were identified using GPS in the field. The GPS data for cars and buses were 
segmented based on these bus stop locations. There are 19 bus stops within the study corridor 
Which implies that there are 18 segments in total for ANN modeling. The input parameters 
considered for the ANN modeling are (I) length of the segments, bus travel-time, and bus stop 
details such as halt time and passenger count, intersection length and eycle time (red-time). The 
car travel-time is the desired output of the ANN model. These parameters were modeled in ANN 
using MATLAB software. The complete five days of segmented data were used as a training and 
testing data forthe ANN model containing the input parameters as described above. The model 
used 80 % of the data for taining purposes and the remaining data for testing and validation 
[17.54]. The number of neurons used forthe model was 20 (optimized after several iterations). 
‘backpropagation algorithm was used for training the model. The number of epochs used for this, 
model was S00 (maximum limit obtained in the model). Car travel-time was obtained as the 
predicted outcome of the model. Fig.. illustrates the details of the ANN algorithm and its 
specifications. 


‘Specifications 


‘eainga Noval Nework 
finah‘iiny | 


4 


ep 2 
viding the Data in Test an Validation eatos 
dividePara ination 


Inpuc (Bus-ave speed, Distance Bus-TT 
intesecon length, us sop actos, signal 
‘ining 


Ouput (Car) 


No. OF put: & 
Noo Output: livideParan alain 


No.of Epc: $00 livideParamtesation 


b. 


Sup 3 
Data using ANN 


‘Output Anal Tani 


‘Than (ANN Nam 


lput Data Output Daa 
Step 4: 


eting Output of ANN, 
"ANN, Naming 


Observe: 3602.97 


Priced 37501 


Abolute Eee: 15.25 


MAPE Value 11.65 


Fig. 3. ANN algorithm and specifications 
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45, Support vector machine: mathematical formulation 


SVMs are leaming systems that generalize on a hypothetical space of Hinear functions on a 
higher dimensional feature space using a trained learning algorithm [76]. The SVMs are used in 
Siting a hyper-plane to the training data through which the data is classified into two classes. The 
hyperplane acts as a decision surface asa margin of separation between the positive and negative 
classes. With these classes, the hyper-plane is obtained by maximizing the margin between the 
hyperplane and the closest training samples, called the “Support vectors” [77]. The support 
vector machines work on the principle thatthe data points which are non-linearly separable are 
transferred to a higher dimension (D) with the help of kerel funetions to make them linearly 
separable in a higher dimension scale. In the higher dimension, a hyperplane is constructed 
between data points fo maximize the margin of separation. For additional details on the general 
concept of SVM, see Vapnik [78], Burges [79], and ScholkopT and Smola [80], 


Ina linear SVM model, the general SVM equation is given as 


rs 


seb o 
Where w is weight vector and b is bias, (x) isthe function associated withthe hyperplane 
Let us assume the taining data D has a set of n points as given by [77] 


> 


of eR tbat é 


 elongs to -1 of 1, to which the point * belongs to in a d- dimensional feature space, 


In eases, where the data points are not linearly separable, Cortes and Vapnik [81], using a 
‘modified SVM algorithm by adding a soft margin [77]. A saruple form of tke SVM funetion in a 
higher dimensional space, as illustrated in [76]: 


sw 


wig la) = godt, 
7 0 


Where represents the hhigh dimensional feature spaces. By adopting the Lagrangian 


‘multiplier method, the parameters such as “+ and » can are estimated. The details of 
‘optimization can be obtained 


ap thcs) 
2 uo) 
aijent HOSE te 


“The factor C and the slack variable 


inthe previous equation 
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“The regularization parameter C takes into account the mislead points and maintains the shape of 
the function and the slack variable® measures the degree of misclassfication ofthe data points 


Finally the maximum margin in the hyperplane after Lagrange minimization is given as below: 


fl) = Darks) +b 
ay 


Where, “is the kernel function and “is the Lagrange multipliers, n= set of support 
vectors 


In the eases where the data is not linearly separable, the kernel trick is applied to transform the 
feature space to. higher dimension space where the data is linearly separable, 


‘One common example of kernel function isthe Gaussian radial basis function 94 \s,.,) given by 
Rakshita etal 82} 


te) 


Where, is the bandwidth ofthe kernel. In the present study, since the classes were overlapping, 
We used a sofl-margin SVM with RBF kernel 


4.6, Hyper-plane creation criteria 


‘The GPS data was classified based on the road geometries such as a number of lanes and 
igadients. The complete corridor data was segmented based on the change of lanes observed 
lroughout the study corridor. As per the Field observations, it was identified that the whole 
corridor consists of single, two lanes, three lanes, four lanes, five lanes, and six lanes. Hence the 
study corridor was segmented based on the changes in the road width during the vehicle 
‘movement and the GPS data of bus and car was segregated based on lanes. Also, the gradients on 
the road were also determined through the elevation data obtained from GPS. Later the gradient 
(of the road was used as a binary classifier to train the model using SVM. The negative 

Were labeled as 0 and the postive gradients were labeled as 1 This binary classification helped 
the SVM model to classify the data based on the gradients, and a hyper-plane was created to 
form the boundary hetween the negative and postive gradients, 


radients 


4.7, Model development using SVM 
Inthe second stage of modeling the data using SVM, a k-fold cross-validation technique was 
used in order to optimize the essential parameters and minimize the bias between the taining and 
testing data, For this study, a radial basis function was used as a Kemel function. The gamma g, 
cost function e and epsilon values taken for the model are g~2, e=256, e=0.1. The gamma and 
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cost function parameters used in the model were optimized using the K-fold eross-validation 
technique. The predicted output of the model isthe car travel-time, The selection of parameters 
for the model is an important step in deciding the model's accuracy. The performance and 
consistency of the model highly depend on the model parameters used in the model. For this 
study, the data was validated in two stages; intially, authors have performed analysis and 
Validated data using the normal tran/test split method (dividing the entire dataset into two sets; 
‘one traning set and another testing se). The advantages of using this method are: (1) it trains 
faster than other existing methods like cross-validation ete; (2) als, it is simpler to examine the 
detailed results of the testing process. However, this method does not account for the biases 
involved in the spliting of training and testing data. Therefore, to bring homogeneity in results 
for the model developed, the traning and testing data was validated using the Kk-fold cross 
validation procedure. The five days data was taken for training the model, and the k-old cross- 
Validation method was used in selecting the optimum parameters (Cost funetion and gamma). 
tons used in SVM modeling are responsible for the mapping of input data into 
feature space. Popular choices of kernel function include radial basis function (RBF), linear 
kkemel, and a polynomial Kernel function. However, inthis study RBF kernel is used; because 
BF is highly effective in mapping nonlinear relationships. There are two parameters for an RBE 
kernel: Cost function (C) and Gamma (y). The C and Gamma values are usually chosen as user 
perceived values. There is no standard method available to determine the optimus values forthe 
C and gamma parameters. Thus, itis essential to identify a method to optimize the parameters 
such thatthe model classifier can accurately predict unknown data (ie. testing data). For solving, 
this problem, common strategies (known as K-fold cross-validation) were used in this study. In 
the k-fold cross-validation, asthe first step, the training data was divided into k subsets of equal 
size, Sequentilly, one subset is tested using the classifier trained on the remaining k-1 subsets, 
‘The complete set of five days data was divided into five equal partitions (i, k equal to 5). Out 
of five days of data, four days of data were used as training data (Day 2, 3, 4, and 5) and the 
remaining one-day data was used as the testing data (Day 1). Then the least ervor was calculated 


Kemet fu 


for each segment of the day -1. The optimum C and Gamma values were obtained from the k= 
fold cross-validation technique. A detailed algorithm with specifications of the SVM model is 
sven in Figure 4 


5. Results and discussions 


‘The results obtained from the developed ANN and SVM algorithms were compared with the 
‘observed travel-time data obtained from the field. The prediction was earried out from the five 
days of data collected for both bus and car modes. After the data is trained using the ANN and 
SVM models, then the next set of data was used for validation; and the output derived is the 
predicted data obtained as the result of the trained model. In this paper, authors have used the bus 
and car data as inputs for taining and obtained the predicted results of ear data with respect to 
‘buses used as probe vehicles. The prediction accuracy was measured in terms of the MAPE 
(Mean Absolute Percentage Error), MAD (Mean Absolute Deviation) and RMSE (Root Mean 
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* 


Square Error) values. Figures $ and 6 show the prediction results for ANN and SVM in 


‘comparison with the original field observed values 


‘Specifications | 


Inpuc [Bus-avg speed, Distance, Bus-TT 
intsection length, bus sop fics, signal 
tining) 


Ouput (Car) 


No. Oflapu:§ 


No of Output: 


‘Outpt aval 
Obre 360297 i 
Prods: 379501 i 
Abolte Eoe:1086 


MAPE Value: 10.78 


Fig. 4 Support Vector Machines 
Mean absolute percentage errar (MAPE) 


‘The wo models we 


MAPE value =+ (2) 100 


izing the parameters with eros validation 


4 seagate the dat ima numberof! 
1 tnee and gradintsas +1 and-1 | 


Seep 
Cronin an stance mats al abe vector 
ing Liber options 


J 


Ses 
Scaling the data sig Libs tol box and 


Seep 
Using SVM command predicting the ouput 
hua (test data) ro the waned data 


£ 


Predicted ouput Carwavettine | 


algorithm and specifications 


-valuated based on the MAPE, MAE and RMSE values 


~The observed value, y’ - predicted value, n number of observations, 


‘Mean average deviation (MAD) 


(3) 


i) 
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‘Root mean square error (RMSE) 


us) 


», js the observed value; y, isthe predicted value and m isthe total number of segments 


‘Observed versus Predicted travel-tim 


‘SVM model (Day-1) 


~ Mota ilies 


Fig. 5. Observed versus predicted trave-time: SVM model 


Observed versus Predicted travel-time - ANN model 
(ay -1) 


‘hitcaaill intial | 


[obenedtvel ine iret rave ine 
Fig, 6, Observed versus predicted tavel-time: ANN model 


‘A weighted percentage error graph for the models developed (ANN and SVM) are plotted in 
Figure 7. Also, a comparison plot of the speed profiles for the different days is given in Figure 8. 
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‘Table § gives the staistical accuracy measures for the predicted data for the ANN and SVM. 
‘models, The exact percentage accuracy of the model was tested using very common error 
‘measures namely, the mean absolute deviation, root mean square error and the mean absolute 
percentage error 


Weighted Percentage E 


Fig. 7. Weighted percentage er graph 
Box plot comparison for speed data 
Fig. 8. Comparison grap for spesd profile 
‘Tables 
Measures of percentage accuracy fr the model developed. 
‘A measure of accuracy (Day 1) ANN model SVM model 
MAPE (“e) 1165 1078 
MAD (s) 1306 985 
RMSE (s) Ise 108 


Standard error(s) 1525 10.86 
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11 is inferred from Figures 5 that the actual values and the predicted values are almost close 
‘except for certain locations (segments) where the deviation in the values may be due to varying 
trafic congestion ata different locaton on different days. The MAPE values for ANN and SVM 
models ae identified as 11.65 and 10.86 respectively. According to Lewis" scale of intexpretation 
‘of estimation accuracy [83], any forecast with a MAPE value of less than 10% ean be considered 
highly accurate, 11-20% as good, 21-5 

the results obtained are reasonable forall with MAPE of les than 15%, 


M% as reasonable, and 51% or more as inaccurate. Thus, 


6. Statistical validation of the proposed models using Kolmogorov-Smirnov 
test (K-S test) 


‘The Kolmogorov-Smirmov tes is used as a statistical evaluation tool to compare the observed 
sample distribution and theoretical sample distribution, The cumulative distribution Function for 


‘a variable witha specified distribution ean be comps 
Square and joint t-tests that the travel-times follow the normal distribution. ere inthis study, the 
Kolmogorov-Smimov test is used as a tool to evaluate the differences in the cumulative 
disiibution forthe observed and the predicted values obtained from the ANN and SVM models. 
‘The null hypothesis assumes no difference hetween the observed and theoretical distribution, 


sing this test, It was observed from ehi- 


Acceptance Criteria: If the calculated value is less than the eritical value, accept the null 
hypothesis. 


Rejection Criteria: Ifthe ealeulated value is 
hypothesis, 


eater than the table (critical) value, reject the null 


‘The test resulls ofthe Kolmogorov-Smimov test conducted for the observed and predicted values 
are depicted in the form of a cumulative Frequency graph in Figures 9 and 10, Table 6 gives the 


test results for the Kolmogorov-Smirnov test conducted 


‘Table 6 
Kolmogorov Smimov (K-S) test. 


Kolmogorov Smimov Calculated value Tabulated value n value Alpha value Null hypothesis 


ANN model 028142 0309 005 ‘Accepted 
SVM model oos7on os7s 2 ons Accepted 


‘The max D value obtained for cumulative differences between the observed and predicted values 
for the ANN model is 0.028142 and the critical value obtained from the table is 0.309 (with a 
degree of freedom as 18 and alpha value as 0,05). Similarly, for the SVM model, the cumulative 
differences obtained are 0,047011 and the critical value obtained from the table is 0.375 (with a 
degree of freedom as 12 and alpha value as 0.05). It is observed that for both the models, the 
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caleulated values are much lesser than the critical (table) values indicating that there are no 
‘major variations between the observed and predicted travel-times at a 95% confidence level 
Hence the null hypothesis is accepted, 


K-S Test graph for ANN 


© asap semen 
ipa CoE edited COF 
Fig 9 Camultive diferences graph Tor the K-S test for ANN model 
K-S Test graph for SVM 


ginal CDF) Prodi CDF 


Fig. 10. Cumulative differences graph forthe K-S tes for SVM model 


7. Summary and conclusions 


‘The present study carried out a detailed analysis of travel-time and it's affecting parameters for 
‘mode-wise ravel- time estimation, This study was conducted with a mative to predict mode-wise 
travel-time as itis considered advantageous for collecting individual data for all the modes 
‘Therefore, this method solves the purpose of extensive data collection, which is tedious for a 
heterogeneous traffic condition. In this paper, authors have attempted to find the ear travel-time 
considering buses as probe vehicles using ANN and SVM models. Iti evident from the results 
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that corridor level travel-time ean be estimated more efficiently by segmenting the corridor into 
bus stop sections and modeling using the Machine learning models. From the results, it can be 
seen that the ANN and SVM models give a reasonable MAPE value of 11.65 and 10.78 
respectively. It is observed that the oblained values are much lower compared to the models 
developed for travel-time estimation for Indian conditions. The significance of this study is 
‘modeling the travel-time affecting parameters and deriving a non-linear relationship model 
between the two modes. This study ineludes certain mode characteristics such as bus stop dwell 
lime, passenger count, etc. The possibilty of applying machine learning models and deriving an 
‘optimized parameter modeling is one of the research highlights ofthis study. Also inthis study. 
the support vector machines are used as both classification and prediction tools. Thus, these 
‘models developed are highly optimized and efficiently used for mode-wise prediction under 
Indian traffic conditions, This study also includes the statistical evidence for validating the 
proposed model using the Kolmogorov-Smirnoy test, It is observed that there are no major 
Variations between the observed and predicted travel-times at a 95% confidence level 
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